Controlling Secondary Structures of Bio-Polymers with Hydrogen-Like Bonding 
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We present results for a lattice model of bio-polymers where the type of /3-sheet formation can 
be controlled by different types of hydrogen bonds depending on the relative orientation of close 
segments of the polymer. Tuning these different interaction strengths leads to low-temperature 
structures with different types of orientational order. We perform simulations of this model and so 
present the phase diagram, ascertaining the nature of the phases and the order of the transitions 
between these phases. 
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I. INTRODUCTION 

The transition of a flexible macromolecular chain from 
a random-coil conformation to a globular compact form, 
called coil-globule transition, has been a subject of exten- 
sive theoretical and experimental studies Generally, 
polymers in a good solvent are modelled by random walks 
with short-range repulsion (excluded volume). Polymers 
undergoing a coil-globule transition are then modelled 
by adding an additional short-range attraction. This 
short-range attraction is both due to an affinity between 
monomers and solvent molecules, affecting the solvabil- 
ity of a polymer, and also due to intra-molccular inter- 
actions between different monomers, for example due to 
van-der-Waals forces. The canonical lattice model 0, Q 
for this transition is given by interacting self-avoiding 
walks, in which self-avoiding random walks on a lat- 
tice are weighted according to the number of nearest- 
neighbour contacts (non-consecutively visited nearest- 
neighbour lattice sites). 

It is recognised that in biological systems, e.g. pro- 
teins, the most relevant contribution to monomer- 
monomer-interactions is due to hydrogen bonds. These 
hydrogen bonds can only form if neighbouring segments 
are aligned in a certain way, resulting in an interaction 
that is strongly dependent on the relative orientation of 
segments. This type interaction plays a leading role in 
the formation of secondary protein structures such as a- 
helices and /3-sheets Q. In this paper, we introduce a 
model for controlling the type of orientational order of 
these structures. 

In [j| , Bascle et al introduced a lattice model of poly- 
mers interacting via hydrogen bonds, in which hydro- 
gen bonds were mimicked by an interaction between 
two nearest-neighbour lattice sites which belong to two 
straight segments of the polymer. This was treated in 
the context of Hamiltonian walks in a mean-field ap- 
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proach, and they predicted a first-order transition be- 
tween an anisotropic ordered phase and a molten phase. 
Later, Foster and Seno introduced this type of interac- 
tion to a model of self-avoiding walks [|| . They analyzed 
it using transfer-matrix techniques in two dimensions, 
where a first-order transition between a folded polymer 
crystal and a swollen coil was found. Subsequently, a 
variant of this model was introduced by Buzano and 
Pretti Q ; where the interaction is defined between paral- 
lel nearest-neighbour bonds, independent of the straight- 
ness required in [f| , arguing that these should better take 
into account the contribution of fluctuating bonds, which 
may be formed even in relatively disordered configura- 
tions. The authors studied this interacting-bond model 
and the one introduced by Foster and Seno on the square 
and simple cubic lattice using the Bethe approximation. 
They found a first-order transition in the Foster-Scno 
model in two and three dimensions, confirming and ex- 
tending results in Q. In contrast to this, they found two 
transitions in the interacting-bond model, a second-order 
^-transition from a swollen coil to a collapsed molten 
globule and then a first-order transition to a folded poly- 
mer crystal. In a later paper Q, they introduced a com- 
peting isotropic interaction and studied its effect in three 
dimensions using the Bethe approximation. They found 
a phase diagram with three different phases (swollen coil, 
collapsed molten globule, folded polymer crystal), similar 
to that of collapsing semi-stiff polymers Q. 

In this work we generalize the Foster-Seno model to 
distinguish between nearest-neighbour contacts of paral- 
lel and orthogonal straight segments (see Figure [l} and 
assign interactions of different strengths to these two 
types of contacts, investigating it with Monte-Carlo sim- 
ulations using the the FlatPERM algorithm We 
begin by simulating the Foster-Seno model and confirm 
the theoretical picture presented above @, Q . We then 
consider our extended model (in three dimensions). We 
find evidence for two differently structured folded phases, 
depending on whether the parallel or orthogonal interac- 
tions dominate. The transition between the swollen coil 
and each of the two collapsed ordered crystals is first- 
order. We investigate the structure of these two low- 
temperature phases. For strong parallel interactions long 



2 



segments of the polymer align, whereas for strong orthog- 
onal interactions the polymer forms alternating orthogo- 
nally layered /3-sheets. 

II. MODEL AND SIMULATIONS 

A polymer is modelled as an n-step self-avoiding walk 
on the simple cubic lattice with interactions — e p and — e 
for nearest-neighbour contacts between parallel and or- 
thogonal straight segments of the walk, as shown in Fig- 
ured Here, a segment is defined as a site along with the 
two adjoining bonds visited by the walk, and we say that 
a segment is straight if these two bonds are aligned. The 
restriction of this model to e = s p is the simple gener- 
alisation of the Foster-Seno model, which was originally 
defined on a square lattice, to three dimensions. 




parallel (p) orthogonal (o) 



FIG. 1: The two types of nearest-neighbour contacts between 
two straight segments of the polymer: parallel segments (left) 
with interaction — e p , and orthogonal segments (right) with 
interaction — e . In two dimensions, only parallel interactions 
are possible. 

The total energy for a polymer configuration (p n with 
n + 1 monomers (occupied lattice sites) is given by 

E n (tp„) = -m p (tp n )e p - m (ip n )e (1) 

depending on the number of non-consecutive parallel and 
orthogonal straight nearest-neighbour segments m p and 
m Q , respectively, along the polymer. For convenience, we 
define 

P P = Pe p and (3 Q = f3e , (2) 

where /3 = 1/ksT for temperature T and Boltzmann 
constant ks ■ The partition function is given by 

Z n (P p ,(3 )= C n , mp , mo e^ m ? + P° m ° (3) 

m p ,m 

with C„. mp . mo being the density of states. We have sim- 
ulated this model using the FlatPERM algorithm [lol ]. 
The power of this algorithm is the ability to sample 
the density of states uniformly with respect to a chosen 
parametrisation, so that the whole parameter range is 
accessible from one simulation. In practice, we have also 
performed multiple independent simulations to further 
reduce errors. The natural parameters for this problem 



are m p and m , and the algorithm directly estimates the 
density of states C„. mpi7Tio for all n < n max for some fixed 
n-max and all possible values of m p and m . Canonical 
averages are performed with respect to this density of 
states. As we need to store the full density of states, 
we only perform simulations up to a maximal length of 
n-max = 128, due to a memory requirement growing as n 3 . 
To reduce the error, we have taken averages of ten inde- 
pendent runs each. Each run has taken approximately 3 
months on a 2.8GHz PC to complete. 

Fixing one of the parameters (3 P and f3 Q reduces the size 
the histogram, and enables us to perform simulations of 
larger systems, as the memory requirement now grows 
as n 2 . Fixing /? Q , say, the algorithm directly estimates a 
partially summed density of states 

Cn,m p (/?o) — ^ Cn,m p .m ^ ■ (4) 

In this way, we can simulate lengths up to n max = 1024 
at fixed (3 . In a similar fashion, we also consider the 
diagonal (3 p = fi Q = f3, which is equivalent to considering 
the partially summed density of states 

Cn.m ^ ^ Cn,m p ,m ■ (5) 

m -\-m p — m 

To reduce the error for our runs up to n = 1024, we have 
taken averages of ten independent runs each. Each run 
has taken approximately 2 months on a 2.8GHz PC to 
complete. 

III. RESULTS 

Before presenting the findings for our model, we briefly 
discuss the results of simulations of the Foster-Seno 
model in two dimensions. We find a first-order transi- 
tion between a swollen coil and ordered collapsed phase 
in agreement with Foster and Seno @ . Figure [5] shows 
the internal density distribution at = ft c = 1.04, 




FIG. 2: Internal energy density distributions for the two- 
dimensional Foster-Seno model at the value of /3 for which 
the fluctuations are maximal, length 1024. 
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where the specific heat is maximal. This distribution is 
clearly bimodal, and finite-size scaling supports the con- 
clusion that the transition is first-order. Our estimate of 
(3 C = 1.04 is close to the value 1.00(2) obtained by Foster 
and Seno Q from transfer matrix calculations. The low 
temperature phase is an ordered /3-sheet type phase. 

For the three-dimensional model, we have explored the 
full two- variable phase space (/3 p ,/3 ) by using a two- 
parameter FlatPERM simulation of the model for lengths 
up to 128. We performed 10 independent simulations to 
ensure convergence and understand the size of the sta- 
tistical error in our results. As in previous work [111, [l^] , 
we found the use of the largest eigenvalue of the matrix 
of second derivatives of the free energy with respect to 
the parameters f3 p and f3 a most advantageous to show 
the fluctuations in a unified manner. Figure [3] displays a 
density plot of the size of fluctuations for < f3 p , (3 Q < 2. 
It suggests the presence of three thermodynamic phases 
separated by three phase transition lines meeting at a sin- 
gle point. For small values of (3 P and /? , we expect the 
model to be in the excluded volume universality class of 
swollen polymers, since at (3 P = (3 Q = the model reduces 
to the simple self-avoiding walk. The question arises as 
to the nature of the phases for large (3 P with (3 a fixed and 
for large (5 with f3 p small, and the type of transitions 
between each of the phases. 
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FIG. 3: This is a density plot of the logarithm of the largest 
eigenvalue of the matrix of second derivatives of the free en- 
ergy with respect to /3 P and /3 at n = 128. The lighter the 
shade the larger the value. 

We find evidence for a strong collapse phase transition 
when increasing f3 p for fixed (3 % 1.38. Corrections to 
scaling at lengths n < 128 make it difficult to identify the 
nature of the transition. The location of the transition 
seems independent of the value of (3 and is located at 
(3 p « 1.25 for length n = 128; this is taken from the 
location of the peak of the fluctuations. Since our data 
indicate that this transition occurs for (3 a < 1.38 at (3 P ss 
1.25, it follows that the diagonal line (3 = (3 P crosses this 
transition line. Configurations in the collapsed phase are 
rich in parallel contacts; we shall discuss further details 
of the collapsed phase below. 



The situation changes significantly for (3 Q > 1.38. 
When we start from the swollen phase at fixed j3 > 1.38 
and increase (3 P we see evidence for a strong phase tran- 
sition to a different collapsed phase, in which orthogonal 
contacts are expected to play an important role. Further 
increase of (3 P leads to another strong transition to the 
parallel-contact rich phase. We investigate the transition 
between the swollen coil and the orthogonal-contact rich 
phase by considering the line f3 p = 1.0. Figure 2] shows a 
bimodal internal energy distribution at the maximum of 
the fluctuations in m for length n = 128, indicating the 
presence of a first-order transition. 

P p =1.0 
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FIG. 4: A plot of the internal energy distribution in m at 
/3 P — 1.0 for length n = 128 at values of f3 for which the 
fluctuations in m are maximal. 

Combining the evidence above, we conjecture the 
phase diagram shown in Figure El having three phases 
and three transition lines that meet at a triple point lo- 
cated at (/3*,/3*) w (1.25,1.38) for length n = 128. By 
considering the location of this point for different lengths 
n, we conclude that its estimate is affected by strong 
finite-size corrections to scaling. 

To further elucidate the nature of the phase transitions 
and the structure of the low-temperature phases, we per- 
form simulations for larger system sizes for the two lines 
(3 = and (3 = (3 p , using one-parameter FlatPERM 
simulations for lengths up to n — 1024, averaged over 
ten independent simulations each. We begin by consid- 
ering p o = 0. The peak of the specific heat occurs at 
(3 P = 0.996 for n — 1024, which we note is shifted away 
from the value at length n = 128 and reflects the presence 
of strong corrections to scaling. The distribution of m p 
at this point is shown in Figure [6l we observe a clear bi- 
modal distribution with well-separated peaks and which 
ranges over fourteen orders of magnitude, convincingly 
supporting the conclusion of a first-order phase transi- 
tion. Similarly, along the line (3 — (3 a — (3 p we find a 
single peak of the specific heat, located at j3 p = 0.998 
for n = 1024. The distribution of m = m + m p at this 
point displays the same characteristics as the transition 
on the line j3 = described above. Our investigations 
of the transition between the two collapsed phases were 
not conclusive, as it is difficult to do simulations at very 
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FIG. 5: This figure represents our conjectured schematic 
phase diagram. The phases boundaries are marked by black 
lines. The dotted-dashed (blue) and dashed (red) lines de- 
note the lines along which we have performed one-parameter 
simulations. 

low temperatures. While we expect there to be a first 
order phase transition between the two collapsed phase 
we were unable to verify this. 
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FIG. 6: Internal energy density distributions of m p at j3 — 
and (3 P = 0.996 for length 1024. 

To delineate the nature of the two collapsed phases, 
we have randomly sampled typical configuration for each: 



two of these are shown in Figure [7] In each case we have 
used f3 p > 0, where parallel contacts are attractive. For 
large /3 P , we have a parallel contact rich phase, and typ- 
ical configurations have lines of monomers arranged in 
parallel. In Figure [71 there is a typical configuration for 
(}3 p = 1.8, (3 = 1.0), which demonstrates these parallel 
/3-lincs. For large /3 , orthogonal contacts play an impor- 
tant role. A typical configuration for (/3 p = 1.3, j3 = 1.9) 
consists of parallel lines arranged in /3-shccts, which arc 
layered orthogonally. The entropy of the phase consist- 
ing out of orthogonal /?-sheets is lower than the entropy 
of the phase consisting out of collection of parallel lines, 
which explains why the collapse-collapse transition line 
is shifted away from the diagonal. Clearly the formation 
of /3-sheets is dependent on (3 p being positive (attractive 
parallel) interactions. 




FIG. 7: Typical configurations for the two different collapsed 
phases, sampled at (/3 P = 1.3, (3 — 1.9) (left) and at (/3 P = 
1.8, Po = 1.0) (right). 

In conclusion, we have demonstrated the intriguing 
possibility of building a secondary structure in proteins, 
which involves layered /3-sheets structures interacting in 
two different ways. Depending on the modelling of the 
interactions we distinguish /3-shects that align parallel or 
orthogonal to each other, which leads to two different 
phases, and secondary structures. There remains an in- 
teresting theoretical question as to the behaviour of the 
system when the parallel interactions are repulsive but 
the orthogonal interactions are highly attractive. 
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